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FIELD OF THE INVENTION 

The present invention relates to optimizing DNA sequences 
in order to (a) improve the properties of a protein of interest 
by artificial generation of genetic diversity of a gene encoding 
the protein of interest by the use of the so-called gene- or DNA 
shuffling technique to create s large library of "genes", ex- 
pressing said library of genes in a suitable expression syste- 
and screening the expressed proteins in respect of specific char- 
acteristics to deterr?.ine such proteins exhibiting desired proper- 
cies or (b) improve the properties of regulatory elements such as 
promoters or terminators by generation of a library of these ele- 
ments, transforming suitable hosts therewith in operable conjunc- 
tion with a structural gene, expressing said structural gene and 
screening for desirable properties in the regulatory element. 



E.-.c:-:G?-ou?iD or the i::vi.::t:c:: 

It is generally zo—z tnat a protein performing a certain 
bioactivity exhibits £ certain variation between genera and even 
between me.TJDers of the same species differences may exist. This 
25 variation is of course even more outspoken at the genomic level. 

This natural genetic diversity among genes coding for pro- 
teins having basically the same bioactivity has been generated in 
Nature over billions of years and reflects a natural optimization 
of the proteins coded for in respect of the environment of the 
30 organism in question. 
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In today's society the conditions of life are vastly re- 
moved from the natural environment and it has been found that the 
naturally occurring bioactive molecules are not optimized for the 
various uses to which they are put by mankind, especially when 
5 they are used for industrial purposes. 

It has therefore been of interest to industry to identify 
such bioactive proteins that exhibit optimal properties in re- 
spect of the use to which it is intended. 

This has for many years been done by screening of natural 
10 sources, or by use of mutagenesis. For instance, within the tech- 
nical field of enzymes for use in e.g. detergents, the washing 
and/or dishwashing performance . of e.g. naturally occurring pro- 
teases, lipases, amylases and cellulases have been improved sig- 
nificantly, by 'in vitro modifications of the enzymes. 
15 In most cases these improvements have been obtained by 

site-directed mutagenesis resulting in substitution, deletion cr 
insertion of specific amino acid residues which have been chosen 
either on the basis of their type or on the basis of their loca- 
tion m the secondary or terr.iary structure of the mature enzv-e 
20 (see for instance US patent no. 4,518,58^). 

In this manner the preparation of novel polypeptide vari- 
ants and mutants, such as novel modified enzym.es with altered 



characteristics, e.g. specific activity, substrate specificity, 
thermal, pH and salt stability, pH-optimum^, pi, K^, V^^v etc., 
25 has successfully been performed to obtain polypeptides with im- 
proved properties. 

For instance, within the technical field of enzymes the 
washing and/or dishwashing performance of e.g. proteases, li- 
pases, amylases and cellulases have been improved significantly. 
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An alternative general approach for modifying proteins and 
enzymes has been based on random mutagenesis, for instance, as 
disclosed in US 4,894,331 and WO 93/01285 

As it is a cumbersome and time consuming process to obtain 
polypeptide variants or mutants with improved functional prop- 
erties a few alternative methods for rapid preparation of modi- 
fied polypeptides have been suggested. 

Weber et al., (1983), Nucleic Acids Research, vol. 11, 
5651, describes a method for modifying genes by in vivo reco.T,- 
bination between two homologous genes. A linear DNA sequence com- 
prising a plasmid vector flan.ked by a DNA sequence encoding al- 
pha-1 human interferon in the 5 '-end and a DNA sequence encoding 
al?ha-2 human interferon in the 3 '-end is constructed and trans- 
fected into a rec A positive strain of £. coJi. Recombinants were 
ide.-tified and isolated usi.-.g a resistance marker. 

Pompon et al.. (1969), Gene 83, p. 15-24, describes a 
nethod for shuffling gene cor.a.r.s of mammalian cytochrome P-450 
by in vivo recombination of partially homologous sequences in 
Saccharo-yces ceravisiae by transforming Saccha rowryces carevisiae 
a Unearized plasr.ic with filled-in ends, and a DNA fragm.ent 
being partially honolocojs to the ends of said piasmid. 

In WO 97/07205 a r.ethod is described whereby polypeptide 
variants are prepared by snuffling different nucleotide sequences 
of homologous DNA sequences by in vivo recombination using pias- 

2 5 micic DNA as template. 

US patent no. 5,093,257 (Assignee: Genencor Int. Inc.) dis- 
closes a method for producing hybrid polypeptides by in vivo re- 
co-bination. Hybrid DNA sequences are produced by forming a cir- 
cular vector comprising a replication sequence, a first DNA se 

30 cuence encoding the amino-tcrn.mal portion of the hybrid poly 
oeotide, a second DNA sequence encoding the carboxy-terminal por 
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tion of said hybrid polypeptide. The circular vector is trans- 
formed into a rec positive microorganism in which the circulnr 
vector is amplified. This results in recombination of said circu- 
lar vector mediated by the naturally occurring reconbinatic?. 
5 mechanism of the rec positive microorganism, which include prc- 
karyotes such as Bacillus and E. coli, and eukaryotes such as 
Sa ccha romyces cerevisiae . 

One method for the shuffling of homologous DNA sequences 
has been described by Steamer (Sterpjner; (1994), Proc. Natl. ?.caz. 

10 Sci. USA, Vol. 91, 10747-10751; Stenmer, (1994), Nature, vol. 
370, 389- 391). The method concerns shuffling homologous DNA se- 
quences by using in vitro PC?, techniques. Positive recombinar.: 
genes containing shuffled DMA sequences are selected from a D!C-. 
library based oh the improved . function of the expressed proteins. 

15 The above method is also described in WO 95/22625. KO 

95/22625 relates to a method :or: shuffling of homologous DNA se- 
quences. An important step in zhe method described in v;o 95/22625 
is to cleave the homologous ter.plate double-stranded polynuclec- 
tide into random fragments of a desired size followed by ho.molc- 

20 cousiy reassembling of the fracr:^.ents into full-length gep.es. 

A disadvantage inherent to the method of WO 95/22625 is, 
however, that the diversity generated through that m^ethod is lim- 
ited due to the use of homolcoous gene sequences (as defined rn 
WO 95/22625) . 

25 Another disadvantage in the method of WO 95/22625 lies m 

the production of the random, fragrr.ents by the cleavage of the 
template double-stranded polynuoleotide . 

A further reference of ip.::erest is WO 95/17413 describing a 
m.ethod of gene or DNA shuffling by recombination of specific C^.'A 

30 sequences - so-called desigr. elements (DE) - either by recombina- 
tion of synthesized double-stranded fragments or recombination cf 
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r,r-oHnrP so-called functional elements 
PGR generated sequences to produce so caii. 

,rE) comprising at least two of the design elements. According 
to the method described in WO 95/17413 the recombination has to 
be oerformed among design elements that have DNA sequences with 
5 sufficient sequence homology to enable hybridization of the dxr- 
fer«nt sequences to be recombined. 

HO 95/17413 therefore also entails the disadvantage tnat 
th. diversity generated is relatively United. Furthermore the 
method described is ti™e consu-.ing. expensive, and not suited :or 

10 autorr-atisation. 

Despite the existence of the above methods there xs stxU a 
ne.-^' for better iterative in vitro recombination methods for pre- 
,a-ng novel oolypeptide variants. Such m.ethods should also be 
capable of being performed xn small volumes, and amenable co 

15 automatisation. 

SUV>-':iRY THE INVENTION 

-'"''''^^^rZZ^^ concerns brxefly the utilizatxon of 

shxft of a newly syntnesxzed DNA strand durxng x. vxcro 

-_T^.v- DNr. Shuffling. By using tnxs 
20 D::.-. svnthesxs xn orcer .o a.a.-/- 

•• i» nh-?^" such results in a more expecx- 



inner, an: 



.v-.n: even a greater variation than x; 



above mentioned methods. 



The method of t 



^ ■ •; o =i«;o v^rv well suited for 
le xnveatxon xs axsj v-i-y 

95 adaotion to automatisation . 

" 3 p...-o..ed er-bodxment the technxcue is used xn combxna- 

txon with an error-prone pol.-rase thereby introducing an even 
e^eater variatxon in the library created. ^ 
More soecxfxcally the oresent xnventxon relates to a methoc 

- ----p-v of rpcomiined homologous 
30 for- the constructxon c: a . — a-y 

Dolvnucieotides from a n_.~jD-3r 



o: different starting single or 
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double stranded parental DMA templates and pri^ners by induce, 
template shifts during an in vitro polynucleotide synthesis usir.r 
a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing parental double stranded DNA templates 
produce single stranded templates, 

b) annealing said primers to the single stranded DNA tem- 
plates, 

c) extending said primers by initiating synthesis by use c: 
said polymerase, 

d) cause arrest of the synthesis, and 

e) denaturing the double, strand to separate the extended 
primers from the templates, 

B. a template' shift is induced by 

a) isolating the newly synthesized single stranded extend., 
primers from the te.-plates and repeating steps A.b) to 
A.e) using said extended primers produced in (A) as both 
primers and templates, or 

b) repeating steps A.b) to A.e), 

C. the above process is terminated after an appropriate n^omi-er 
of cycles of process steps A. and B.a), A. and B.b), or ccr.- 
binations thereof, and 

D. ODtionally the produced poly.nucleotides are amplified in e 
standard PGR reaction with specific primers to selectively 
am.plify homologous polynucleotides of interest. 

In specific emlDodiments various modifications can be m.ac 
m the orocess of the invention. For example it is advantage-., 
to apply a defective polyT.,erase either an error-prone polymeras 
to -introduce mutations in ccr.parison to the templates, or a ;c 
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lymerase that will discontinue the polynucleotide synthesis pre- 
maturely to effect the arrest of the reaction. 

Further modifications will be described below, 
in a further aspect the invention relates to a method of 
5 identifying polypeptides exhibiting improved properties .in com- 
parison to naturally occurring polypeptides of the same bioactiv- 
ity, whereby a library of recombined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
10 to be expressed into the corresponding polypeptides and dis- 
played, said polypeptides are then screened in a suitable assay, 
and Dositive results selected. . 

in a still further aspect the invention relates to a method 
fo- producing a polypeptide of interest as identified in the pre- 
15 ceding orocess, whereby a vector comprising a polynucleotide en- 
cod^'ng said polypeptide is transformed into a suitable host, saxd 
host is grown to express saxc polypeptide, and the polypeptide 

r^cove^ed and purified. 

Finallv, in further final aspects the invention 
20 relates to a recomh.ned/shuf f led protein, which xs obtainable by 
apv of the methods accorc.ng to the invention, and which is a 
reco.^med/shuffled prote.n comprising the sequences disclosec 

Herein [vide infra). 

in those final aspects of the invention, the term, 
25 "Obtainable" denotes that said protein is preferable obtained by 
a method according to the invention. However a prior art known 
recombination/shuffling technique such as those described in v;0 
95/22625 or v;0 95/17413 may be used too, either alone or m 

^. . ^„^^Hinr- rn tho invention, in order to 
combination witn a metnoa according >.o en. _u 

30 obtain said recombined protein. 

Accordingly, further final aspect of the invention are; 



wo 9S/41653 




PCT/DK98y00104 



a recorrODined/shuf f led protease obtainable by any of the 
methods according to the invention, and comprising a recombinec 
sequence, which at least contain two different partial sequences 
fron at least two different wild-type proteases; 
5 a recombined/shuffled lipase obtainable by any of the 

methods according to the invention, and comprising a recombined 
sequence, which at least contain two different partial wild-type 
sequences 

a recombined/shuffled Pseudomonas lipases obtainable by any 
10 o: the methods according to the invention, and comprising a 
reccrjDined sequence, which at least contain two different partial 
sequences from at least two of the different wild-type 
Pseudomonas lipases ; 

a recoa^-bined/shuf f led xylanase obtainable by any of the 
15 methods according to the inver.tion, and comprising a reco^Lbined 
sequence, which at least contair. two different partial sequences 
from at least two different wild-type xylanases; 

a recoT-bined/shuf f led celiulase obtainable by any of the 
methods according to the invention, and comprising a recorribined 
socuence, which at least contain two different partial sequences 
from at least two different wild-type cellulases; 

recori)ined/sh'j: : leo amylase obtainable by any of the 
-ethods according to the invention, and comprising a recor?^ined 
sc-r.\e':\ce, which at least contain two different partial sequences 
25 from at least two different wild-type amylases 

a recombined/shuffled laccase obtainable by any of the 
methods according to the invention, and comprising a recoml^ined 
socuonce, which at least ccntf.m two different partial sequences 
from nt least two different wil::-type laccases; 
30 , a rccoiubmed/shuff led rnytase obtainable by any of the 

n-thods according to the invention, and comprising a recombined 
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sequence, which at least contain two different partial sequences 
from at least two different wild-type phytases. 
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DEFINITIONS 

prior to discussing this invention in further detail, the 
following terms will first be defined. 

"Shuffling": The term "shuffling" means recombination or 
nucleotide seouence fragment (s) between two or more homologous 
polvnucleotides resulting in output polynucleotides (i.e. polynu- 
cleotides having been subjected to a shuffling cycle) having a 
nu.-nber of nucleotide fragments exchanged, in comparison to the 

^^r.r^d^^ (i e sta-ting ooint homologous polynucleo- 
input polynucleotides [i.e. sud-u-ny . 

tides) . 

^'Homology of DNA sequences or polynucleotides" In the pres- 
ent context the degree of DM. sequence hom.ology is determined as 
th= degree of identity between two sequences indicating a deriva- 
tion of the firsc sequence fro. the second. The homology may 

. ■ ..,c- r^- ,-n"^^"^or orocrams known in the 
su^^abiy be determinea oy means O:. com?^--r uiu^ 

as' G.H? provided m the GCG program package (Program 



art, sucn 
Mar.uai for t 



. the Wisconsin Package, Version 8, August 1994, Genet- 



_ ^nOuter Grouo, 57 5 Science Drive, Madison, Wisconsin, USA 
53711 ) (rieedleman, S.B. and Wunsch, G.D., (1970), Journal of Mo- 
1-cuIar Biology, 48, 443-453). 

"Homologous": The term "homologous" means that one single- 

hv/h^idize to a comoleraentary 
sr^and-d nucleic acid sequence may hyo.iaize 

s.ngle-stranded nucleic acid sequence. The degree of hybridiza- 
^.o^ nav denend on a nu..her of factors including the amount or 
id^nt^.tv between the sequences and the hybridization conditions 
such as tem.perature and sal. concentration as discussed later 

( Vide infra ) . 



wo 98/41653 




PCT/DK98/0O1O4 



Using the computer program GAP (vide supra) with the fol- 
lowing settings for DNA sequence comparison: GAP creation penally 
of 5.0 and GAP extension penalty of 0.3, it is in the prese'z 
context believed that two DNA sequences will be able to hybridize 
5 (using low stringency hybridization conditions as defined belcv) 
if they mutually exhibit a degree of identity preferably of a. 
least 70%, more preferably at least 80%, and even more preferably 
at least 85%. 

"heterologous": If two or more DNA sequences mutually ex- 
10 hibit a degree of identity which is less than above specified, 
they are in the present context said to be ''heterologous". 

"Hybridization:" Suitable experimental conditions fcr 
determining if two or more DMA sequences of interest do hybridize 
or not is herein defined as hybridization at low stringency as 
15 described in detail belov/. 

Molecules to which the oligonucleotide probe hybridizes 
under these conditions are detected using a x-ray film or a phos- 
phoinager . 

"prim.er": The terx "prir^.er" used herein especially in cor.- 
20 nection with a PGR reactiop. is an oligonucleotide (especially a 
?CR-or irr.er " ) defined and constructed according to general stan- 
dard specifications knov/r. ir. the art ("PGR A practical approach" 
IRL Press, (1991) ) . 

''A orimer directed to a secuence:" The term ''a prim.er cr- 
25 rected to a sequence" r.eans that the primer (preferably to he 
used in a PGR reaction) is constructed to exhibit at least 8G"^ 
decree of sequence identity to the sequence fragment of interest, 
more preferably at least 90'< degree of sequence identity to the 
sequence fragment of interest, v;hich said primer consequently is 
30 ''directed to". The primer is cesigned to specifically anneal at 
the sequence fragment or region it is directed towards at a given 
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temperature. Especially identity at the 3' end o£ the 'primer is 

essential. 

"Flanking" The tern "flanking" used herein in connection 
with DNA sequences comprised in a PCR-fragment means the outer- 
5 most partial sequences of the PCR-fragment, both in the 5 and 3 

ends of the PCR fragment. 

"Polypeotide" Polymers of amino acids sometimes referred 
,o as proteins. The sequence of amino acids determines the 
folded conformation that the polypeptide assumes, and this xr. 
10 turn determines biological properties and activity. Some 
poWoeptides consist of a single polypeptide chain (monomerxc) , 
.,.;:as other comprise several associated polypeptides 
(.ultimeric) . All 'enzym.es and antibodies are polypeptides. 

"Enzvme" A protein capable of catalysing chemical reac- 
ts t^ons. specific types of enzy^.s to be mentioned are such as 
a-ylases," oroteases, carbohydrases , lipases, cellulases, oxi- 

. ..rprases -tc. Of soecific interest in relation 
doreductases, esterases, 

«"■7^y-,::>q uscd In detergents, such as 
to the present inventior. are e..zyr,-S usee 

iin;(<;os cellulases, amylases, etc. 
oroteases, lipases, k^^^^^^^ 
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invention relates m its first aspect to a 



30 



^■^^^^r^^^r^r ^lO^: or THE INVEN TION 
The oresent invention relates 

.■^^ „^ P library of recombined homologous 
method for the construction o. a Horary o 

1 .'Ho. --rnn a ru-b = r of different starting single or 
Dolvnucieotides j-rom a nu..-j-- 

doJo.e stranded parental 0... templates and primers by .ndnced 
template shifts during an in v..ro polynucleotide syntheses usrn, 

a polym.erase, whereby 

pre synthesized by 
A. extended primers or poiynucl--o.iaes are byi 

^..^ -^-^nhie stranded DMA template to 
a) denaturing a paren.c^^L „ouoie s^.auuc 

produce single strar.cec templates, 
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b) annealing said primers to the single stranded DNA tem- 
plates, 

c) extending said primers by initiating synthesis by use c: 
said polymerase, 

d) cause arrest of the synthesis, and 

e) denaturing the double strand to separate the extended 
primers from the templates, 

B. a template shift is induced by 

a) isolating the newly synthesized single stranded extended 
primers from the templates and repeating steps A.b) zz 
A.e) using said extended primers produced in (A) as bc-r. 
primers and templates, or 

b) repeating steps A.b) to A.e), 

C. the above 'process is terminated after an appropriate number 
of cycles of process steps A. and 3. a), A. and 3.b), or cc— 
binations thereof^ and 

D. optionally the produced polynucleotides are amplified in a 
standard PGR reaction with specific primers to selectively 
amplify homologous polynucleotides of interest. 



Accordinc to the invention the poiynerase rr.ay be a DNA or a 
RN'A polym.erase, specific polym.erases to be mentioned are such as 
DNA polymerases like T4 polym.erase, T7 polymerase , E. coli 
polyir.erase I or the Kienow fragment of DNA polymerase I, or the:- 
25 mostable polymerases such as Taq, .^jnplitaq, Vent, Pwo . 

One of the advantages of the invention is that it makes it 
possible to control the length of the extension of the primers in 
the reaction in a convenient manner. 

This can be accomplished by various means such as choice zi 
30 polyrr.erase, the physical and chem.ical conditions during the a:- 
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tion of the polymerase, e.g. pH, temperature, buffer,* salt con- 
centration, and addition of various chemicals. 

It is known that various polymerases carry out the DN.A syn- 
thesis at different rates (nucleotides incorporated pr. second) . 
For example has the Klenow fragment of DMA polymerase I a limited 
extension rate compared to e.g. the Taq polymerase (Sambrook et 
al. 1989) . 

Polymerases also display differences in processivity, which 
is the average number of nucleotides incorporated before the po- 
lyn^.erase dissociates from the template/extended primer; again the 
Klenow polymerase is an example of a polymerase with limited 
processivity. 

The choice of polymerase is therefore an important means in 
controlling the average extension of the primers. 

These conditions may also exert an influence on the fidel- 
^tv of the polymerase (the rate by which point mutations are in- 
troauced; HIV reverse transcriptase xs an example of a polymerase 
of low fidelity), a parameter .seful in combining shuffling and 
r^.utiagenesis . 

v^-ious rr.odif ications can be mace 
20 In soeciric e"^oa^..ie.. .^^^o^^ 

o^ocess of tne invention. Tor example it is advantageous 
.0 aoolv a defective polym.erase either an error-prone polvT^erase 
to introduce mutations m comparison to the templates, or a no- 
iv-^-ase that will ciscontmue the polynucleotide synthesis pre- 
,5 m=turely to effect the arrest of the reaction. Such a defective 
poiv^.erases that could be m.entioned is a Klenow polymerase having 

low orocessivity . 

in another er^odir..ent o: the invention polyir.erase will be 

u ^^.r^lo — - oolvTTi^rase used is not therrr.o- 

added after each cycle, joxyui-.Lci 

30 stable. 
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According to the invention the starting single or double 
stranded parental templates may be different in that they contair. 
different point mutations in the same native polynucleotide 
(gene), or they can be homologous polynucleotides (genes) iso- 
5 lated from nature, which may be amplified by PGR, or they can ze 
combinations thereof. The templates used in the process of the 
invention are hereby homologous showing an identity at the DN*.-. 
level of e.g. more than 95%, 90%, 85%, 80%, 75%, 70%, 65%, 6C^, 
55%, or even more than 50% identity. 

IQ It may be advantageous to use pre-selected templates ccz- 

prising mutations with improved properties of interest. The pres- 
ent recombination method of the invention will then recombir.e 
said improved mutations for subsequent screening for even further 
imorovements in the properties or interest. 

15 Said pre-selected templates with improved properties of in- 

terest may have been identifier: oy standard procedures in the ar: 
comprising e.g. i) error-prone ?CR of templates of interest fol- 
lowed by ii) screening/selection for templates with improved 
characteristic of interes^. Tne mutagenesis frequency (low or 

20 high mutagenesis frequency) of zr,e error-prone ?CR step is pref- 
erablv adjusted in relation to the subsequent screening capacity, 
I.e. if the screening capacity is limited the error-prone ?CR 
frecuency is preferably low (i.e. one to two mutations in each 
te-plate) (see WO 92/18645 for further details). 

25 The arrest of the polym.erase reaction in step A.d) may as 

indicated above be obtained in different ways, such as by raising 
the temperature, or adding specific reagents as described in v;0 
95/17413. 

When raising the temperature for this purpose, it is pre- 
30 f erred to use temperatures cctv:een 90''C and 99°C. 
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When using chemical agents DMSO is a possibility. Appropri- 
ate procedures are mentioned in e.g. WO 95/17413. 

The process of the invention uses annealing of the primers 
to the templates in step l.b. In this context the annealing may 
5 be random or specific, meaning either anywhere on the polynucleo- 
tide or at a specific position depending on the nature of the 
primer . 

Also, the primers to be used may be completely random prim- 
ers (NNNNMMNNNNN) (N meaning a mixture of the four bases (A, T, 
10 G, C) is used at a particular position in the primer during syn- 
thesis) , serai-random primers, or specific primers. 

If the extended primers produced are to be separated from 
the primers during the process it is convenient to use labeled 
templates in order to provide a simple means for separation, a 
15 preferred marker is biotin or digoxigenin. 

According to the invention the number of cycles necessary 
will be less than 500, in most cases less than 200, and normally 

less than 100 cycles. 

In an embodiment of the invention the above in vitro shuf- 
20 flir.c is corr^ined with a subsequent in vivo shuffling by methods 
such as those described in Y.D 97/07205. 

In its second aspect the invention relates to a method or 
identifying polypeptides exhibiting improved properties m com- 
parison to naturally occurring polypeptides of the same bioactiv- 
25 ity, v;hereby a library of recombined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
to be expressed into the corresponding polypeptides, said 
polypeptides are then screenec m a suitable assay, and positive 
30 polyDeptides selected. 
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In an embodiment it is contemplated that the polypeptides 
of interest encoded by the shuffled library are expressed as a 
suitable fusion protein (e.g. as a hybrid with glll of bac- 
teriphage M13/fd) in order to display said recombined polypeptide 
5 on the surface of phage or bacteria. 

In a third aspect the invention relates to a ir.ethod fcr 
producing a polypeptide of interest as identified in the preced- 
ing process, whereby a vector comprising a polynucleotide encod- 
ing said polypeptide is transformed into a suitable host, sai~ 
10 host is grown to express said polypeptide, and the polypeptide 
recovered and purified. 

The use of partial random (semi-random) or completely rar.- 
dom primers (mixtures of bases in a selected number or all posi- 
tions in the primer) as initiation point for the DNA synthesis 
15 provide certain novel possibilities for the corrbined use of shuf- 
fling and random mutagenesis. 

It is often associated v;ith difficulties to obtain an ir. 
vitro recombination of polynucleotides chat display relatively 
lir.ited homology. By Che use o: an eTbodiment of the inventicr. 
20 even very diverse polynucleotides can be forced to reco.Tbine. 

According Co that e~JDociment at least two templates (or two 
pools of diverse te.T.plates) are applied. The novel synthesis of 
the one pol vnucleot ide can then be based on only one strand {i.e. 
either the sense or the anti-sense strand) , and the synthesis c: 
25 the other polynucleotide is based the opposite strand. 

This can be accomplished by isolating the complementary 
strands from the two templates, e.g. by having these strands la- 
beled by biotin. Synthesis of d:-:.-. is initiated by annealing ei- 
ther specific, partly randcm or completely random primers zo 
30 these ter?.plates and adding r: s'^itable polym.erase. This can re 
performed as either separate reactions for the different ter-- 
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plates or in just one reaction. Synthesis should preferably be 
performed under conditions that favor production of relatively 
short new fragments. These fragments can subsequently be isolated 
from the templates based on the affinity label. A PCR reaction is 
5 carried out on these fragments and as the starting material 
originates from two different strands, the newly synthesized 
fragments must recorrJoine in order to produce full length PC?, 
products - a kind of forced recombination. 

Also for this embodiment rapid PCR with short or no exten- 
10 sion time can be applied advantageously in order to enhance re- 
combination, especially if pools of templates are used for the 
two strands. 

The length 'of the primer and the annealing temperature 
utilized in the- process determines if random primers will anneal 
15 and the number of mismatches between the template and the primer 
that can be accommodated. By varying the primer length and the 
annealing temperature the method of the invention provides . a 
means for achieving random mutagenesis within a certain nucleo- 
txce window representing the length of the primer. The method of 
20 the invention thereby provides substantial benefits compared to 
other random mutagenesis approaches, especially the high prob- 
for several base substitutions close to each other in the 



abiiitv 



primary sequence, e.g. the use of a completely random 20'n-.er 
(mixture of all four nucleotides in all 20 positions) will ac- 
25 cording to theory under given experimental conditions give a cer- 
tain reasonably high probability for having several base substi- 
tutions close to each other. 

Error prone PCR (=hich mutagenesis frequency PCR) does not 
provide this possibility. Error prone PCR provides a very low 
30 orobability for having more than one base substitution within one 
codon (coding for one ammo acid in a translated polypeptide) . 
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Obviously the substitution of only one base within a codo: 
doesn't provide total random mutagenesis (at protein level) as 
only a limited set of amino acid substitutions can be obtained b% 
one base substitution at DNA level (e.g. Methionine encoded by 
5 ATG-codon requires three base substitution to become the TGT or 
TGC-codon encoding Cysteine) . 

In one embodiment of the invention the process is therefore 
performed by using random or semi-random primers having a length 
of from 6 to 200 bp, preferably from 10 to 70 bp, and better tro~ 
10 15 to 40 bp. 

One of the advantages in the method of the invention is the 
robustness. In some embodiments the constant presence of full 
length template provides a further advantage avoiding FCR con- 
tamination problems. Furthermore it is much less laborius, with 
15 less hands on, than other described methods, thereby providinc 
excellent possibilities for au t: c-.a t ion . 

PCR-primers : 

The PGR primers are constructed according to the standard 
20 descriptions in the art. N'orrr.ally they are 10-75 base-pairs (bp! 
long. However, for the specific eir^JDodim.ent using random cr semi- 
random primers the length -ay be substantially longer as indi- 
cated above. 

2 5 PCR-reactions : 

If not otherwise mentioned the PCR-reaction performed ac- 
cording to the invention are performed according to standard pro- 
tocols known in the art. 

The term ''Isolation of fragment" is intended to cover 

30 as broad as simply an aliquc*. ccntaining the PGR fragment. How- 
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ever preferably the PGR fragment is isolated to an extend which 
remove surplus of primers, nucleotides templates etc.. 

In an embodiment of the invention the DNA fragment (s) 
is (are) prepared under conditions resulting in a low, medium or 
high random mutagenesis frequency. 

To obtain low mutagenesis frequency the DNA sequence (s) 
(comprising the DNA fragment (s)) may be prepared by a standard 
PGR amplification method (US 4,683,202 or Saiki et al., (1988), 

Science 239, 487 - 491) . 

A medium or high mutagenesis frequency may be obtained by 
performing the PGR amplification under conditions which increase 
the misincorporation of nucleotides, for instance as described by 
Deshler, (1992), GATA 9(4), 103-106; Leung et al., (1989), Tech- 
nique, Vol. 1, No. 1, 11-15. 

It is also contemplated according to the invention to com- 
bine the PGR amplification (i.e. according to this embodiment 
also DNA fragment mutation) with a mutagenesis step using a suit- 
able physical or chemical nutagenizing agent, e.g., one which in- 
duces transitions, transversions , inversions, scrami^ling, dele- 
20 tiop.s, and/or insertions. 
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recorrjDinant orotein from the recomisinant shuffled 



sequences 

Expression of the reccrriinant protein encoded by the shuf- 
25 fled sequence in accordance with the second and third aspect of 
the present invention nay be performed by use of standard expres- 
sion' vectors and corresponding expression systems known in the 
art . 



30 Screening and selection 
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In the context of the present invention the term "positi 
polypeptide variants" means resulting polypeptide variants po 
sessing functional properties which has been improved in compar 
son to the polypeptides producible from the corresponding inp- 
DNA sequences. Examples, of such improved properties can be = 
different as e.g. enhance or lowered biological activity, i: 
creased wash performance, thermostability, oxidation stabiliti. 
substrate specificity, antibiotic resistance etc. 

Consequently, the screening method to be used for identifi. 
ing positive variants depend on which property of the polypeptiz 
in question it is desired to change, and in what direction th 
change is desired. 

A number of suitable screening or selection systems " 
screen or select for a desired biological activity are describe 
in the art. Examples are: 

Strauberg et al. { Bic^echp.ology 13: 669-673 (1995) ce 
scribes a screening syste- for subtilisin variants having Cal 
ci urn- independent stability; 

Bryan et al. (Proteins 1:326-334 (1986)) describes 
screening assay for proteases having a enhanced thermal stabil 
izy; and 

PCT-DK96/00322 descrioes a screening assay for lipases haT 
ing improved wash per f orr:;ance in washing detergents. 

An errlDodiment of the invention comprises screening or se 
lection of recombinant protein (s), wherein the desired biologica 
activity is performance in cish-wash or laundry detergents. Exar 
pies of suitable dish-wash or laundry detergents are disclosed 
PCT-DK96/00322 and WO 95/30011. 

If, for instance, the polypeptide in question is an enzy 
and the desired improved functional property is the wash perfor 
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ance, the screening may conveniently be performed by use of a 
filter assay based on the following principle: 

The recombination host ceil is incubated on a suitable me- 
dium and under suitable conditions for the enzyme to be secreted, 
5 the medium being provided with a double filter comprising a first 
protein-binding filter and on top of that a second filter exhib- 
iting a low protein binding capability. The recombination host 
cell is located on the second filter. Subsequent to the incuba- 
tion, the first filter co:7.?rising the enzyme secreted from the 
LO recorrJDination host cell is separated from the second filter com- 
prising said cells. The first filter is subjected to screening 
for the desired enzym.atic activity and the corresponding micro- 
bial colonies present on the second filter are identified. 

The filter used for binding the enzymatic activity may be 
15 any protein binding filter e.g. nylon or nitrocellulose. The top- 
filter carrying the colonies of the expression organism may be 
any filter that has no or low affinity for binding proteins e.g. 
cellulose acetate or Durapore®. Tne filter may be pre-treated 
..ith any of the conditions to be used for screening or may be 
20 treated during the detection of enzvTr.atic activity. 

The enzym.atic activity r.ay be detected by a dye, fluores- 
cence, precipitation, pH indicator, IR-absorbance or any other 
known technique for detection of enzymatic activity. 

The detecting compounc may be immobilized by any immobiliz- 
25 ing agent e.g. agarose, agar, gelatin, polyacry lamide , starch, 
filter paper, cloth; or any cori^mation of iminobilizing agents. 

If the improved functional property of the polypeptide is 
not sufficiently good after one cycle of shuffling, the polypeo- 
tide mav be subjected to another cycle. 
30 - in an embodiment o: tne invention wherein homologous 

oolvnucleotides representin:: a nuroer of mutations of the same 
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gene is used as templates at least one shuffling cycle is a back- 
crossing cycle with the initially used DNA fragment, which may be 
the wild-type DNA fragment. This eliminates non-essential muta- 
tions. Non-essential mutations may also be eliminated by using 
5 wild-type DNA fragments as the initially used input DNA material. 

Also contemplated to be within the invention is polypep- 
tides having biological activity such as insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin, parathyroid hormone, pit-j- 
ary hormones, somatomedin, erythropoietin, luteinizing hormone, 

10 chorionic gonadotropin, hypothalamic releasing factors, antidiu- 
retic hormones, thyroid stimulating hormone, relaxin, interferon, 
thrombopoeitin (TPO) and prolactin. 

A requirement to the starting parental DNA sequences, en- 
coding the polypeptide (s) , to be shuffled, is that they are ac 

15 least 50^, 60%, 70%, 801, 90^3, or 95'^ homologous. DNA sequences 
being less homologous will have less inclination to interact and 
recombine , 

It is also contemplated according to the invention to shuf- 
fle parental polynucleotides that are homologous as indicated 

20 above originating fror. v/ild type organisms of different genera. 

Further, the starting parental templates to be shuffled m.ay 
preferably have a length of from about 50 bp to 20 kb, preferably 
about 100 bp to 10 kb, more preferred about 200 bp to 7 kb, espe- 
cially about 400 bp to 2 kb . 

25 The starting parental DNA sequences may be any DNA se- 

quences including wild- type DNA sequences, DNA sequences encoding 
variants or mutants, or modifications thereof, such as extended 
or elongated DNA sequences, and may also be the outcome of DNA 
sequences having been subjected to one or more cycles of shuffl- 

30 ing (i.e. output DNA sequences) according to the method of the 
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invention or any other method (e.g. any of the methods described 
in the prior art section) . 

When using the method of the invention the resulting recon- 
bined homologous polynucleotides (i.e. shuffled DNA sequences), 
5 have had a number of nucleotide fragments exchanged. This results 
in replacement of at least one amino acid within the polypeptide 
variant, if comparing it with the parent polypeptide. It is to be 
understood that also silent exchanges are contemplated (i.e. nu- 
cleotide exchange which does not result in changes in the amino 
10 acid sequence) . 

MATERIALS AND METHODS 

Specific Method Used in the Examples: 

A) DNA encoding different enzyme variants of the saxe gene or 
15 different enzymes having the sa.^e type of activity encoded by 

homologous genes are mixed. Tho DNA is provided as either PCR 
fracT.ents, plasmid, phage or cer.c.-nic DNA. 

B) ' The resulting pool of D;:;-. is mixed with DNA Polymerase, 
dNT?, a suitable buffer ar.d primers (being either random oli- 

20 gor-.ers (length of 6-30 nucleotides) or specific oligomers 
(lep.g-h of 6-50 nucleotides) or a coTiination of both types). 

C) The ?CR mixture .3 put into a ?CR thermocycier (either 
cold or hot) in a suitable tube. 

cl) The thermocycier is heated to a temperature of 9C- 
25 100=C for a period of time ( typically 1-10 min) in order to dena- 
ture the DNA templates. 

c2) Thereafter the foiioving procedure (cycle) is fol- 
lowed (repeated): The template is denatured (typically 90-100=C 
for 0-5 minutes). Then the te:-::erature is lowered (typically tc 
30 a value between 10°C and 90=C tor G-5 minutes) to allow annealing 
of the prim.er to the single srrar.ded template. Now the tempera- 
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ture is raised again to denaturation temperature (90-100*'C) al- 
lowing small extension of the primer to be synthesized by the 
DNA polymerase during ramping. Alternatively a short extensicn 
period (typically 0-30 seconds at 70-75°C) can be introduced zz 
5 allow larger extensions of the primers to be generated. When the 
temperature reaches a value where denaturation takes place, the 
extended primers and templates are again separated. This proce- 
dure can be repeated (typically between 1 to 99 cycles). 
D) Having performed the desired number of cycles the gener- 

10 ated small DNA polym.ers can be purified from the oligomers usez 
as primers. One way is to isolate and clone a specific amplifier 
band containing the gene coding for the polypeptide of interest 
into a suitable vector. This can be done either on agarose gel 
(typically isolating fragments between 50 to 1000 base pairs), 

15 by beads (using an affinity label on either templates or pri.i- 
ers) or through columns. 

Z) Then the purified (or the not purified) DNA polv-m.ers car. 
be asseaJDled in a standard rCR reaction (for instance 94°C, 5 
minutes, (94°C, 30 sec; 55"C, 30 sec; 12^., 2 min) ^ 25, 72°C 5 

20 minutes, 4°C) . 

Specific primers cr O-JA pol\T:.ers generated by specific 
primers can be added in order to generate a specific DN\-. polymer 
containing the gene of interest. This .As mentioned in point D, 
this D\lk polym.er can be purified and cloned into a vector of in- 

25 terest. 



PCT/DK98/00104 
WO 98/41653 25 

Method 1 . ^ ' 1^ 

The strong advantage, of the method exemplifxed here .s tne 

robustness and lack of PGR contamination problems, due to the 

constant presence of parental template. Furthermore this method 

is less labor demanding than methods described in the prior art, 

thereby providing excellent possibilities for automation. 

Nine different plasmids containing DNA sequences encoding 
9 different variants of the H. lanuginosa lipase gene, were 
.ixed in equimolar amounts. The variant genes contained from two 
to soven mutations scattered throughout the gene. 

The DNA seouence of the H. lanuginosa lipase gene and the 
a., no acid sequence of the lipase are disclosed in EP 0 305 216 

The variants are indicated according to standard terminol- 
ogy as described in e.g. EP 0 396 603, and WO 97/07202. 

^ • n ,rnr-i3nr n^^n'^s '.-.'ere shufiled: 

The following 9 varianu g^a^:> 



1 ' N94K^D96L+E99K 

2- . SP?RR?^N94K.D96L-231R^N2 33R'D234R^Q249R 
S??RR?rA19C-C36;^-N54K-D95L-Q2 4 5R 

r pp^M 9 4 Kr D9 5L^E2 3 5C^S2 4 9 ?- 
D137G-i-Dl67G-!-E210V 
7 ■ . D95L+E99K-^Vi87.". 
8' . SP?RRP + D57G-^-M94KTD96L-fQ249?. 

9' . M94R-^F95L 



.,^0-;=. v=>a in a microtube: 
The following cory.ponenus ....e.. 

/ - s soecific primers flan-: 

2 ^il plas~id mixture i..^:^ ^- 

ing-the gene (1 pmol/^D , ^ 2.. m.. c 
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10* taq buffer (Perkin Elmer), 0.5 |il taq enzyme in a total vol- 
ume of 20 |il. 

The tube was set in a Perkin Elmer 2400 thermocycler . The 
following PCR-program was run: (94°C, 5 minutes) 1 cycle: 
5 (94°C , 30 seconds, 70°C, 0 seconds) 99 cycles(72°C, 2 minutes, 

4°C indefinite) 1 cycle 

The PCR-reaction was run on a 1.5 % agarose gel. A DNA- 
band of the specific expected size was cut out of the agarose 
gel and purified using JETsorb (from GENOMED Inc.). The purified 
10 PCR-product was cloned into a TA-vector (from Invitrogen (the 
original TA cloning kit) . The ligated product was transformed 
into a standard Escherichia coli strain (DH5a) . 

20 transformants where fully sequenced across the gene of 
interest. 

15 

Result : 

The following 20 variants v;ere found: 

1 . D13 7G-^D167G + E210VtY213C 

2 . S??RR?-^D57G-N94K + D9 6L-^Q2^; 9R 
20 :';94R-^r95L 

4 . S??RR?-^D137G-^Dl67GrE21CV 

5 . N94K^D9 5L-fE9 9K^Vi3 7ArT2 57I 

6. D137GfD167G^E210V 

7 . N94K+D9oLrE99K^V187A 
25 e. D57G+N94R+F95L+Q249R 
9 . N94K+D9 6L+E9 9K^E210V 
10 . S??RR? + A19C + C3 6A + N9r:CrD9 5L 

11. N'94R-fF95L 

12 . D137GtD157G+E210V 
30 13/ N94KtD96L-Q249R 
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14. SPPRRP+Q15P+A19C+C36A+N94K+D96L 

15 . SPPRRP+N94K+D96L+T231R+N233R+D234R+Q249R 

16. D137G+D167G+E210V 

17. SCIRR+N94K+D96L+Q249R 
5 18. N94K+D96L+E99K 

19. N94R+F95L 

20. SPPRRP+N94R+F95L+F113S+Q249R 

Nearly all mutations where represented (19 of 20) indicat- 
10 ing little bias for specific templates. 



20 



Statistics: 


Not shuffled 




10 


Shuffled between at 


least 2 templates 


8 


Shuffled between at 


least 3 tenplates 


2 



Tb» shuffled sequences can then, be subcloned from the E.coli TA 
vector into the yeast vector pJS026 as a BanHI-Xbal fragment 
15 (see WO 97/07205), and e.g. screened for new shuffled sequences 
with inoroved performance in detergents (see WO 97/07205). 



Method 2: 

PCR products of 10 different lipase variant genes were 
generated as above and pooled in equimolar amounts. 



The following 10 mutant genes were shuffled. 
1'. D137C- + D167G + E210V 
25 2 ' . D96L+H:99K+V187.='. 

3 ' . -M94K+D96L-i-E99K 
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4 • . SPPRRP + D57G+N94K+D96L-fQ249R 
5' . D111N+F211A+G225P 

6' . SPPRRP+N94K+D95L-^T231R+N233R+D234R+Q24 9R 
7' , SPPRRP+A19C+C36A+N94K+D96L+Q249R 
5 8 ' . STPRRP+N94R 
9' . N94R+F95L 

10'. SCIRR+N94K+D96L+E239C+Q249R 

The following mixture was generated in a suitable tube: 
10 1 jal PGR mixture (0.1 |ig) , decamer random priner (3C' 

pmol), 2 ^1 10* Klenow buffer (Promega), 0.25 m.M dNT?, 2.5 rJ-i 
MgCl, in a total volume of 20 fjl . 

The mixture was set in a PS2400 thermocycler where the 
following program was run: S6''C, 5 minutes, 25°C 5 minutes, 0.5 
15 mil Klenow enzyme was added, 25°C 60 minutes, 35°C 90 minutes. 

This procedure generates a high num±)er of small DM.^ poly- 
m.ers originating from all parrs of the gene 

10 1^1 was taken out for test on agarose gel. 
10 i-il ?CR mixture {0.25 -l-'i cMT?, 1 )al 10* Taq buffer 
20 (Perkin Elmer), 2.5 m:-: y.zZl., 0.5 |il Tac enzyrr.e) was added to 
the 10 lil in the tube in the thermocycler. Then the following 
szar.aard PCR-program was run: (94°C, 5 minutes) 1 cycle, (94°C 53 
seconds, 45°C, 30 seconds, 72°C 30 seconds) 25 cycles, 12°C 7 
m.inutes, 4°C indefinite. 
25 The PGR products were run on a 1.5% agarose gel. A clear 

unbiased smear was seen. DMA betv/een 400 and 800 bp was isolated 
f rom. the gel. 

Half of the purified T Z': product was mixed in a tube wiih 
two 'specific primers (40 c^zi) flanking the gene of interest, 
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10 



15 



0.25 mM dNTP, 2 ^1 10^ Taq buffer, 2.5 rmM MgClj. Then the fol- 
lowing standard PCR-program was run: (94°C , 5 minutes) 1 cycle, 
{94°C 30 seconds, 50°C, 30 seconds, 72°C 30 seconds) 25 cycles, 

72°C 7 minutes, 4°C indefinite. 

The PGR product was run on a 1.5% agarose gel. A specific 
though weak band of the expected size was isolated. Additional 
PGR was run using specific primers (as mentioned above) in order 
to amplify the PGR-product before cloning. 

The PGR-product and the desired vector were cut with the 
appropriate restriction enzymes ( Bamiil /Xhol) . The vector and the 
PGR product were run on a 1.5% agarose gel, and purified fro~ 
the gel. 

The cut PCR-proGuct and the cut vector were mixed in a 11- 
gase buffer with T4 DNA ligase (Promega). After overnight liga- 
tion at 16°C the mixture was transformed into E, coli strain 
DHSa . 

19 clones were fully sequer.ced across the gene. 



"^0 
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:ollowinq 19 variants were round: 



25 



30 



57Pp.Dp-.M9 4P^^v2 3 3R-^D2 3 4R-Q2 4 9R 
S??RR?-f N2 3 3R-^D2 3 4R-^Q2 4 5R 
D9cL-^D167G + Q24 9R 
M 9 4 R 
D167G 

SPPRR? + A19CrA2 8TTN9 4:C^D9 5L-Dl 1 ir^rE:2 3 9G 
SPPRR?-rA19G + C3 6ArN94K-D9cL-Q2 4 9R 
N94K-^D96L 

N2 5T + D57G + N94R^£99K-D167G-T2 31R-N2 33R-fD2 34R+Q24 9R 



wo 98/41653 



30 



PCT/DK98/00ia4 



10. N94R+Q249R 

11. D167G 

12. D167G 

13. T32I+N94R+F95L+D167G+Q249R 
5 14, E87K+N94K+D96L 

15. N94R+F95L+Q249R 

16. N94K+D96L+D111N 

17. STPRRP+S17T 

18. N94K+D96L+V187A 

10 19. SPPRRP+D57G+N94K+D96L+D111N+L151S 

All template variants were represented indicating littlt 
bias for specif ic 'templates . 

There were no apparent hot spots with regard to mutatior 
15 exchange and it seems to be evenly distributed along the gene 
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. coii TA vector i 
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ast vector 


pJS026 as a 


BamHI- 



20 fragment (see WO 97/07205), and e.g. screened for new shuffled 
sequences with inproved performance in detergents (see WO 
97/07205) . 
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Example 3: 

Amylase variant shuffling: 

in Example 1, it was shown how a number of multiple vari- 
ants of H. lanuginosa lipase were shuffled. In a similar manner, 
variants of Bacillus a-amylases can be shuffled. 

Earlier patent applications have identified variants of 
various a-anylases from Bacillus species improved for particular 
properties, e.g. thermostability, stability under Calciun- 
depieted conditions, improved wash-performance etc. (see 
WO95/10503, W096/23874, W096/23873, and PCT/DK97/00197 ) . 

variants of B. licheniformis a-amylase amyL can be shuf- 
fled as follows. The variants are all located in the S. suotiUs 
exoression vector pDN1528 described in WO95/10603. 

The exoeriment is carried out under the exact same condi- 
tions as Example 1 e.xcept that the flanking 27mer primers used 
to initiate DN.^^. synthesis were different. 

-CR anolified band of approximately 1500 bp is ouri- 

f'.'om an agarose gel and cloned as described in E.xamplel. 
:.i-... = t-.-y restriction sites located within the amyL gene car. 
be utilized to clone the library of shuffled genes into either 

^nMis^s o' E. coli vector containing the 
3 = c-ilus plasmid pDNlD.^^ O- an 

. e G OJ--N1 described in W095/23874 

wild type amyi. gene, e . . p>.--..i- 
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Example 4 : 
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Shuffling of two genes encoding homologous a-amylases: 
amyL and the amylase identified by SEQ.ID no2 (amino acid) and 
SEQ.ID no. 5 (DNA) described in W096/23873. 

5 The forward strand (identical to the mRNA) of amyL can be ampli- 
fied in a PGR using standard conditions. 

The forward strand is separated from the reverse strand 
based on its affinity to streptavidin coated magnetic beads and 
denaturation of the two strands with NaOH. 
10 Similarly the reverse strand (complementary to mRNA) of the amy- 
lase encoded by SEQ.ID no5 (W096/23873) can be amplified and 
isolated 

Two primer strands are used as templates in a PGR: (94''C 5 
minutes) + 99 cycles of 30 seconds; 60°G, 0 seconds) +• 

15 (72°C, 5 minutes) using random primers of various lengths, Taq 
polymerase and standard buffer conditions as described in Exam- 
ple 1 . 

The resulting approximately 1500 bp product is cloned ei- 
ther as TA cloning as described in Example one (for verification 
20 o: the sequence of the resulting clone) or into Bacillus vector, 
pTVBllO utilizing Sfil and ?stl restriction sites. 



The original template can be removed from the PGR at any 
step (e.g. after 5, 10 or 20 cycles) based on the biotin tag). 
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PATENT CLAIMS 

1. a method for the construction of a library of recombined 
homologous polynucleotides from a number of different starting 
5 single or double stranded parental DNA templates and primers by 
induced template shifts during an in vitro polynucleotide syn- 
thesis using a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing a parental double stranded DNA template to 
produce single stranded templates, 

b) annealing said primers to the single stranded DNA 
templates, 

c) extending' said primers by initiating synthesis by use 
of said polym.erase, 

15 d) cause arrest of the synthesis, and 

e) denaturing the double strand to separate the extended 
primers from the templates, 

B. a template shift is induced by 

a) isolating the newly synthesized single stranded ex- 
tended primers frc" the templates and repeating steps • 
A.b) to A.e) usir.c said extended prim.ers produced in 
(A) as both prirr.ers and terr.plates, or 

b) repeating steps A.b) to A.e), 

C. the above process is terrr.inated after an appropriate nu.m- 
25 ber of cycles of process steps A. and B.a), A. and 3.b), 

or corrbinations thereof, and 

D. ootionally the produced polynucleotides are amplified in a 
standard PGR reaction wit;-, specific primers to selectively 
amplify homologous polynucleotides of interest. 



2 The method of claim 1, v;herein said polymerase is a ther- 
mostable DNA polymerase, such as Taq, Amplitaq, Vent, Pwo. 
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3. The method of claim 1, wherein said polymerase is 2 DNA 
polymerase, such as T4 polymerase, T7 polymerase , E. coli DNA 
polymerase I or the Klenow fragment of DNA polymerase I, and 

5 said polymerase is added to the reaction mixture after each cy- 
cle . 

4. The method of claim 1, wherein said polymerase is a RNA 
polymerase . 

10 

5. The method of any of the claims 1 to 4, wherein said po- 
lymerase is an error-prone polymerase, such as a reverse tran- 
scriptases like HIV reverse transcriptase. 

15 6. The method of any of the claims 1 to 5, wherein said 
starting single or double stranded parental templates are dif- 
ferent in that they contain different point mutations in the 
same native polynucleotide (cene), or are homologous polynu- 
cleotides (genes) isolated fror^ nature. 

20 

7. The method of clai" 5, v/herein said starting single or 
double stranded parental templates exhibit an identity at the 
DNA level of e.g. more than 95%, 90%, 85%, 80%, 75%, 70%, 65%, 
60%, 55%, or even more than 50% identity. 

25 

8. The method of any of the claims 1 to 7, wherein the ar- 
rest of the polymerase reaction in step A.d) is achieved by 
raising the temperature. 

30 9. , The method of claim S, v;r.erein the temperature is raised 
to a value between 90^C and 99=C, preferably 94°C. 
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10 The method of any of the claims 1 to 7, wherein the ar- 
rest of the polymerase reaction in step l-d is achieved by add- 
ing specific reagents, such as DMSO. 

11. The method of any of the claims 1 to 10, wherein said 
primers comprise a population of completely random primers. 

12. The method of any of the claims 1 to 10, wherein said 
10 primers comprise a population of semi-random primers. 

13. The method of any of the . claims 1 to 10, wherein said 
primers comprise a population of specific primers. 

15 14. The method of any of the claims 1 to 13, wherein said 
primers are labeled, preferably with biotin or digoxigenin. 

.5 -he method of any of the claims 11, 12 or 14, wherein 
saic random or semi-rando. prir^.ers have a length of fro.x 6 to 

^ - -ic^n n-n pnd better frorr. 25 to 150 
20 500 bp, preferably rroni id to 300 op, and d...^^ 

bp . 

, . . r,-' rh^ cTa^ns 13 or 14, wherein said 

^'5. The net HOC or c.ty o^ iine ^^^^^^-^^ 

ori-ers are 6 to 75 bp long. 



25 



-Ko r^ip^n^; 1 to 16, wherein said 
17 T^ne method or any or che cla^ns i lu x , 

^ . t^no^.at-s a-e at least 50%, 60%, 70%, 

starting parental Dl'*h ^enp^c^^b 

80%, 90%, or 95% homologous. 

30 18 The method of claim 17, v.-herem said starting parental 
temolates originate from v;ild type organisms of different 



DMA 
genera 
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19. The method of any of the claims 17 or 18, wherein said 
starting parental templates have a length of from about 50 bp 
to 20 kb, preferably about 100 bp to 10 kb, more preferred 

5 about 200 bp to 7 kb, especially about 400 bp to 2 kb 

20. The method of any of the claims 1 to 19, wherein said 
starting parental templates are linear DNA fragments generated 
by a PGR reaction. 
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21. The method of any of the claims 1 to 20, wherein said 
starting parental templates are . cloned into suitable vectors, 
such as a plasmid. 

22. The method of any of the ciair.s 1 to 21, wherein said 
starting parental templates represent a polynucleotide encoding 
an enzyme, such as a carbonyl hydrolase, carbohydrase, or es- 
terase, such as a protease, lipase, amylase, cellulase, oxi- 
dase, oxido reductase, etc. 



23. The method of any ci the claims 1 to 21, wherein said 
starting parental templates represent a polynucleotide encoding 
a polypeptide having biological activity, such as insulin, 
ACTH, glucagon, somatostatin, somatotropin, thymosin, parathy- 
25 roid hormone, pituary horm.ones, sorr.atomedin, erythropoietin, 
luteinizing horm.one, chorionic gonadotropin, hypothalamic re- 
leasing factors, antidiuretic horm.ones, thyroid stimulating 
hormone, relaxin, interferon, throriopoeitin (TPO) and prolac- 



tm , 



24. The method of any of the claims 1 to 21, wherein said 
starting parental templates represent a polynucleotide contain- 
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ing a biological function such as transcription initiation 
(promoter) or termination, translational initiation, operator 
sites related to expression of genes 

5 25. The method of any of the claims 1 to 24, wherein said 
number of cycles are less than 500, 200, or less than 100 cy- 
cles . 

26. A method of identifying a polypeptide of interest exhib- 
10 iting improved properties in comparison to naturally occurring 

or other known polypeptides of the same activity, whereby a li- 
brary of recombined homologous .polynucleotides produced by a 
process according to any of the claims 1 to 25 are cloned into 
an appropriate vector, said vector is transformed into a suit- 
15 able host system, to be expressed into the corresponding 
polypeptides and displayed, and said polypeptide is screened in 
a suitable assay, and positive polypeptides selected. 

27. A method for producing a polypeptide of interest as iden- 
20 tified according to claim 25, whereby a vector comprising a 

polvnucleotide encoding said identified polypeptide is trans- 
formed into a suitable host, said host is grown to express said 
polypeptide, and the polypeptide recovered and purified. 
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